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Abstract. The Description Logic ££ has recently drawn considerable attention since, on 
the one hand, important inference problems such as the subsumption problem are polyno- 
mial. On the other hand, ££. is used to define large biomedical ontologies. Unification in 
Description Logics has been proposed as a novel inference service that can, for example, 
be used to detect redundancies in ontologies. The main result of this paper is that unifica- 
tion in £C is decidable. More precisely, ££-unification is NP-complete, and thus has the 
same complexity as £'£-matching. We also show that, w.r.t. the unification type, ££. is 
less well-behaved: it is of type zero, which in particular implies that there are unification 
problems that have no finite complete set of unifiers. 



1. Introduction 

Description logics (DLs) [6] are a family of logic-based knowledge representation for- 
malisms, which can be used to represent the conceptual knowledge of an application domain 
in a structured and formally well-understood way. They are employed in various application 
domains, such as natural language processing, configuration of technical systems, databases, 
and biomedical ontologies, but their most notable success so far is the adoption of the DL- 
based language OWL [20j as standard ontology language for the semantic web. 

In DLs, concepts are formally described by concept terms, i.e., expressions that are 
built from concept names (unary predicates) and role names (binary predicates) using con- 
cept constructors. The expressivity of a particular DL is determined by which concept 
constructors are available in it. From a semantic point of view, concept names and concept 
terms represent sets of individuals, whereas roles represent binary relations between indi- 
viduals. For example, using the concept name Woman, and the role name child, the concept 
of women having a daughter can be represented by the concept term 

Woman □ 3 child. Woman, 

and the concept of women having only daughters by 

Woman □ Vchild. Woman. 
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Knowledge representation systems based on DLs provide their users with various inference 
services that allow them to deduce implicit knowledge from the explicitly represented knowl- 
edge. An important inference problem solved by DL systems is the subsumption problem: 
the subsumption algorithm allows one to determine subconcept-superconcept relationships. 
For example, the concept term Woman subsumes the concept term Woman □ 3 child.Woman 
since all instances of the second term are also instances of the first term, i.e., the second 
term is always interpreted as a subset of the first term. With the help of the subsumption 
algorithm, a newly introduced concept term can automatically be placed at the correct 
position in the hierarchy of the already existing concept terms. 

Two concept terms C, Z? are equivalent (C = D) if they subsume each other, i.e., if 
they always represent the same set of individuals. For example, the terms Vchild.Rich □ 
Vchild.Woman and Vchild. (Rich □ Woman) are equivalent since the value restriction operator 
(Vr.C) distributes over the conjunction operator (□). If we replace the value restriction 
operator by the existential restriction operator (3r.C), then this equivalence no longer 
holds. However, for this operator, we still have the equivalence 

3child.Rich n 3 child. (Woman n Rich) = 3 child. (Woman n Rich). 

The equivalence test can, for example, be used to find out whether a concept term repre- 
senting a particular notion has already been introduced, thus avoiding multiple introduction 
of the same concept into the concept hierarchy. This inference capability is very important 
if the knowledge base containing the concept terms is very large, evolves during a long time 
period, and is extended and maintained by several knowledge engineers. However, testing 
for equivalence of concepts is not always sufficient to find out whether, for a given concept 
term, there already exists another concept term in the knowledge base describing the same 
notion. On the one hand, different knowledge engineers may use different names for con- 
cepts, like Male versus Masculine. On the other hand, they may model on different levels of 
granularity. For example, assume that one knowledge engineer has defined the concept of 
men loving fast cars by the concept term 

Human □ Male fl 3 loves. Sports_car. 

A second knowledge engineer might represent this notion in a somewhat different way, e.g., 
by using the concept term 

Man n 3 loves. (Car n Fast). 

These two concept terms arc not equivalent, but they arc meant to represent the same 
concept. The two terms can obviously be made equivalent by substituting the concept 
name Sports_car in the first term by the concept term Car D Fast and the concept name 
Man in the second term by the concept term Human □ Male. This leads us to unification 
of concept term.s. i.e., the question whether two concept terms can be made equivalent by 
applying an appropriate substitution, where a substitution replaces (some of the) concept 
names by concept terms. Of course, it is not necessarily the case that unifiable concept 
terms are meant to represent the same notion. A unifiability test can, however, suggest 
to the knowledge engineer possible candidate terms. A unifier (i.e., a substitution whose 
application makes the two terms equivalent) then proposes appropriate definitions for the 
concept names. In our example, we know that, if we define Man as Human □ Male and 
Sports_car as Car □ Fast, then the concept terms Human □ Male □ 3 loves.Sports_car and 
Man n 3 loves. (Car n Fast) are equivalent w.r.t. these definitions. 
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Unification in DLs was first considered in [12] for a DL called FCq^ which has the 
concept constructors conjunction (□), value restriction {\/r.C), and the top concept (T). It 
was shown that unification in J-Cq is decidable and ExpTime-complete, i.e., given an J-Cq- 
unification problem, we can effectively decide whether it has a solution or not, but in the 
worst-case, any such decision procedure needs exponential time. This result was extended 
in [8] to a more expressive DL, which additionally has the role constructor transitive closure. 
Interestingly, the unification type of J-Cq had been determined almost a decade earlier in 
[2]. In fact, as shown in [12j, unification in J-Cq corresponds to unification modulo the 
equational theory of idempotent Abelian monoids with several homomorphisms. In [2] it 
was shown that, already for a single homomorphism, unification modulo this theory has 
unification type zero, i.e., there are unification problems for this theory that do not have 
a minimal complete set of unifiers. In particular, such unification problems cannot have a 
finite complete set of unifiers. 

In this paper, we consider unification in the DL £C The iS£-family consists of inexpres- 
sive DLs whose main distinguishing feature is that they provide their users with existential 
restrictions (3r.C) rather than value restrictions {\/r.C) as the main concept constructor 
involving roles. The core language of this family is EC, which has the top concept, conjunc- 
tion, and existential restrictions as concept constructors. This family has recently drawn 
considerable attention since, on the one hand, the subsumption problem stays tractable 
(i.e., decidable in polynomial time) in situations where J-Cq, the corresponding DL with 
value restrictions, becomes intractable: subsumption between concept terms is tractable 
for both FCq and EC [25l [lOj, but allowing the use of concept definitions or even more 
expressive terminological formalisms makes J^Cq intractable [Ml IS [23l [5] , whereas it leaves 
EC tractable [U [171 IS]- On the other hand, although of limited expressive power, EC is 
nevertheless used in applications, e.g., to define biomedical ontologies. For example, both 
the large medical ontology Snomed ctQ and the Gene OntologjH can be expressed in EC, 
and the same is true for large parts of the medical ontology Galen [27] . The importance 
of EC can also be seen from the fact that the new OWL 2 standarcd contains a sub-profile 
OWL 2 EL, which is based on (an extension of) EC. 

Unification in EC has, to the best of our knowledge, not been investigated before, but 
matching (where one side of the equation(s) to be solved does not contain variables) has 
been considered in [71[2l|. In particular, it was shown in [23] that the decision problem, i.e., 
the problem of deciding whether a given iS£-matching problem has a matcher or not, is NP- 
complete. Interestingly, FCq behaves better w.r.t. matching than EC: for FCq, the decision 
problem is tractable [9]. In this paper, we show that, w.r.t. the unification type, FCq and 
EC behave the same: just as J-Cq, the DL EC has unification type zero. However, w.r.t. 
the decision problem, EC behaves much better than J-Cq: f/I-unification is NP-complete, 
and thus has the same complexity as f^/3-matching. 

Regarding unification in DLs that are more expressive than EC and J-Cq, one must 
look at the literature on unification in modal logics. It is well-known that there is a close 
connection between modal logics and DLs [6]. For example, the DL ACC, which can be 
obtained by adding negation to EC or FCq, corresponds to the basic (multi-)modal logic K. 
Decidability of unification in K is a long-standing open problem. Recently, undecidability 
of unification in some extensions of K (for example, by the universal modality) was shown 

"'^http: / / www.ihtsdo.org/ snomed-ct / 

^http:/ /www.geneontology.org/ 

"^See http:/ /www.w3.org/TR/owl2-profiles/ 
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in [29]. The undecidability results in [29j also imply undecidability of unification in some 
expressive DLs (e.g., SHIQ [21j). The unification types of some modal (and related) 
logics have been determined by Ghilardi; for example in [19j he shows that K4 and S4 have 
unification type finitary. Unification in sub-Boolean modal logics (i.e., modal logics that are 
not closed under all Boolean operations, such as the modal logics corresponding to £C and 
TCq) has, to the best of our knowledge, not been considered in the modal logic literature. 

In addition to unification of concept terms as introduced until now, we will also consider 
unification w.r.t. a so-called acyclic TBox in this article. Until now, we have only talked 
about concept terms, i.e., complex descriptions of concepts that are built from concept 
and role names using the concept constructors of the given DL. In applications of DLs, 
it is, of course, inconvenient to always use such complex descriptions when referring to 
concepts. For this reason, DLs are usually also equipped with a terminological formalism. 
In its simplest form, this formalism allows to introduce abbreviations for concept terms. 
For example, the two concept definitions 

Mother = Woman n 3 child. Human and Woman = Human □ Female 

introduce the abbreviation Woman for the concept term Human □ Female and the abbre- 
viation Mother for the concept term Human □ Female □ 3 child. Human. A finite set of such 
concept definitions is called an acyclic TBox if it is unambiguous (i.e., every concept name 
occurs at most once as left-hand side) and acyclic (i.e., there are no cyclic dependencies 
between concept definitions). These restrictions ensure that every defined concept (i.e., 
concept name occurring on the left-hand side of a definition) has a unique expansion to a 
concept term that it abbreviates. Inference problems like subsumption and unification can 
also be considered w.r.t. such acyclic TBoxes. As mentioned above, the complexity of the 
subsumption problem increases for the DL J-Cq if acyclic TBoxes are taken into account 
|26j . In contrast, for £C, the complexity of the subsumption problem stays polynomial in 
the presence of acyclic TBoxes. We show that, for unification in £C, adding acyclic TBoxes 
is also harmless, i.e., unification in £C w.r.t. acyclic TBoxes is also NP-complete. 

This article is structured as follows. In the next section, we define the DL £C and 
unification in £C more formally. In Section [3l we recall the characterization of subsumption 
and equivalence in £C from [23] , and in Section U] we use this to show that unification 
in £C has type zero. In Section [5l we show that unification in £C is NP-complete. The 
unification algorithm establishing the complexity upper bound is a typical "guess and then 
test" NP-algorithm, and thus it is unlikely that a direct implementation of this algorithm 
will perform well in practice. In Section [6l we introduce a more goal-oriented unification 
algorithm for £C, in which non-deterministic decisions are only made if they are triggered 
by "unsolved parts" of the unification problem. In Section \7\ we point out that our results 
for f jC-unification imply that unification modulo the equational theory of semilattices with 
monotone operators [28j is NP-complete and of unification type zero. 

More information about Description Logics can be found in [6], and about unification 
theory in [16]. This article is an extended version of a paper [11] published in the pro- 
ceedings of the 20th international Conference on Rewriting Techniques and applications 
(RTA'09). In addition to giving more detailed proofs, we have added the goal-oriented 
unification algorithm (Section [6]) and the treatment of unification modulo acyclic TBoxes 
(Subsection ES]). 
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concept name 


A 


A^ <^Vx 


role name 


r 




top-concept 


T 


T^ = Vx 


conjunction 


CnD 


(C n Df = C^nD^ 


existential restriction 


3r.C 


(3 r.C)^ = {x \3y : {x,y) e Ay e C^} 


subsumption 


CQD 




equivalence 


C = D 





Table 1: Syntax and semantics of £C 



2. Unification in £C 

In this section, we first define the syntax and semantics of £^>C-concept terms as well as 
the subsumption and the equivalence relation on these terms. Then, we introduce unification 
of £^>C-concept terms, and finally extend this notion to unification modulo an acyclic TBox. 

2.1. The Description Logic £C. Starting with a set Neon of concept names and a set 
Nroie of role names, EC-concept terms are built using the following concept constructors: 
the nullary constructor top-concept (T), the binary constructor conjunction (C □ D), and 
for every role name r G N^oie, the unary constructor existential restriction (Br.C). The 
semantics of £C is defined in the usual way, using the notion of an interpretation I = 
{T>Xi which consists of a nonempty domain Vx and an interpretation function that 
assigns binary relations on T>x to role names and subsets of Dx to concept terms, as shown 
in the semantics column of Table [TJ 

The concept term C is subsumed by the concept term D (written C 1^ D) iff C 
holds for all interpretations X. We say that C is equivalent to D (written C = D) iff 
C ^ D and D ^ i.e., iff = holds for all interpretations X. The concept term 
C is strictly subsumed by the concept term D (written C \Z D) \S C ^ D and C ^ D. 
It is well-known that subsumption (and thus also equivalence) of f /Z-concept terms can be 
decided in polynomial time |10j . 

2.2. Unification of concept terms. In order to define unification of concept terms, we 
first introduce the notion of a substitution operating on concept terms. To this purpose, 
we partition the set of concepts names into a set of concept variables (which may be 
replaced by substitutions) and a set Nc of concept constants (which must not be replaced by 
substitutions). Intuitively, are the concept names that have possibly been given another 
name or been specified in more detail in another concept term describing the same notion. 
The elements of Nc are the ones of which it is assumed that the same name is used by all 
knowledge engineers (e.g., standardized names in a certain domain). 

A substitution cr is a mapping from N^ into the set of all if£-concept terms. This 
mapping is extended to concept terms in the obvious way, i.e., 

• (t[A) := A for all A G A^^, 

• <t(T) :=T, 

• a{C n D) := a{C) H a{D), and 
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• a{3r.C) := 3r.a{C). 

Definition 2.1. An £C -unification problem is of the form F = {Ci =^ Di, . . . , C„ =^ Dn], 
where Ci, Di, . . . , Cn,Dn are £" /^-concept terms. The substitution cr is a unifier (or solution) 
of r iff a{Ci) = (j{Di) for z = 1, . . . , n. In this case, F is called solvable or unifiable. 

When we say that £'£-unification is decidable, then we mean that the following decision 
problem is decidable: given an £'/3-unification problem F, decide whether F is solvable or 
not. Accordingly, we say that £'>C-unification is NP-complete if this decision problem is 
NP-complete. 

In the following, we introduce some standard notions from unification theory [T6], but 
formulated for the special case of £'>C-unification rather than for an arbitrary equational 
theory. Unifiers can be compared using the instantiation preorder ^. Let F be an £C- 
unification problem, V the set of variables occurring in F, and a, 9 two unifiers of this 
problem. We define 

a ^ 6 iff there is a substitution A such that 0{X) = X{(t{X)) for all X ^ V. 

If a ^ 0, then we say that 9 is an instance of a. 

Definition 2.2. Let F be an £^£-unification problem. The set of substitutions M is called 
a complete set of unifiers for F iff it satisfies the following two properties: 

(1) every element of M is a unifier of F; 

(2) if ^ is a unifier of F, then there exists a unifier a € M such that a ^ 0. 

The set M is called a minimal complete set of unifiers for F iff it additionally satisfies 

(3) if cr, 6* e M, then a <• 6* implies a = 9. 

The unification type of a given unification problem is determined by the existence and 
cardinalitjQ of such a minimal complete set. 

Definition 2.3. Let F be an f /^-unification problem. This problem has type 

• unitary iff it has a minimal complete set of unifiers of cardinality 1; 

• finitary iff it has a finite minimal complete set of unifiers; 

• infinitary iff it has an infinite minimal complete set of unifiers; 

• zero iff it does not have a minimal complete set of unifiers. 

Note that the set of all unifiers of a given /^-unification problem is always a complete 
set of unifiers. However, this set is usually infinite and redundant (in the sense that some 
unifiers are instances of others). For a unitary or finitary f /^-unification problem, all unifiers 
can be represented by a finite complete set of unifiers, whereas for problems of type infinitary 
or zero this is no longer possible. In fact, if a problem has a finite complete set of unifiers 
M, then it also has a finite minimal complete set of unifiers, which can be obtained by 
iteratively removing redundant elements from M. For an infinite complete set of unifiers, 
this approach of removing redundant unifiers may be infinite, and the set reached in the 
limit need no longer be complete. This is what happens for problems of type zero. The 
difference between infinitary and type zero is that a unification problem of type zero cannot 
even have a non-redundant complete set of unifiers, i.e., every complete set of unifiers must 
contain different unifiers cr, 9 such that a ^ 9. More information on unification type zero 
can be found in [1]. 



It is easy to see that the cardinahty of a minimal complete set of unifiers is uniquely determined by the 
unification problem. 
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When we say that 8C has unification type zero, we mean that there exists an £C- 
unification problem that has type zero. Before we can prove in Section [5] that this is indeed 
the case, we must have a closer look at equivalence in £C in Section [3l But first, we consider 
unification modulo acyclic TBoxes. 

2.3. Unification modulo acyclic TBoxes. A concept definition is of the form A = C 
where ^ is a concept name and C is a concept term. A TBox T is a finite set of concept 
definitions such that no concept name occurs more than once on the left-hand side of a 
concept definition in T. The TBox T is called acyclic if there are no cyclic dependencies 
between its concept definitions. To be more precise, we say that the concept name A directly 
depends on the concept name B m. a, TBox T \i T contains a concept definition A = C 
and B occurs in C. Let depends on be the transitive closure of the relation directly depends 
on. Then T contains a terminological cycle if there is a concept name A that depends on 
itself. Otherwise, T is called acyclic. Given a TBox T, we call a concept name A a defined 
concept if it occurs as the left-side of a concept definition A = C m T. All other concept 
names are called primitive concepts. 

The interpretation I is a model of the TBox T iff A-^ = holds for all concept 
definitions A = C in T. Subsumption and equivalence w.r.t. a TBox are defined as follows: 
CQrD{C =r D) iff C (C^ = D^) holds for all models Z of T. 

Subsumption and equivalence w.r.t. an acyclic TBox can be reduced to subsumption 
and equivalence of concept terms (without TBox) by expanding the concept terms w.r.t. 
the TBox: given a concept term C, its expansion w.r.t. the acyclic TBox T is obtained 
by exhaustively replacing all defined concept names A occurring on the left-hand side of 
concept definitions A = C in T by their defining concept terms C . Given concept terms 
C, D, we have C C7- D iff ^ [11]. The same is true for equivalence, i.e., C =7- D 
iff = D'^. This expansion process may, however, result in an exponential blow-up 
[261 I14j . and thus this reduction of subsumption and equivalence w.r.t. an acyclic TBox 
to subsumption and equivalence without a TBox is not polynomial. Nevertheless, in £C, 
subsumption (and thus also equivalence) w.r.t. acyclic TBoxes can be decided in polynomial 
time [4j. 

In our definition of unification modulo acyclic TBoxes, we assume that all defined 
concepts are concept constants. In fact, defined concepts already have a definition in the 
given TBox, and thus it does not make sense to introduce new ones for them by unification. 
In this setting, a substitution a is a mapping from A'^ into the set of all i?£-concept terms 
not containing any defined concepts^ The extension of a to concept terms is defined as in 
the previous subsection, and its application to T is defined as 

a{T) := {A = a{C) \A = Cer}. 

Definition 2.4. An EC-unification problem modulo an acyclic TBox is of the form T = 
{Ci =^ Di, . . . ,Cn =7- Dn}, where Ci, Di, . . . , C„, Dn are f £-concept terms, and T is an 
acyclic SC-TBox. The substitution cr is a unifier (or solution) of F modulo Tiff o"(Cj) =a{T) 
cr(Di) for i = 1, . . . ,n. In this case, T is called solvable modulo T or unifiable modulo T. 

Coming back to our example from the introduction, assume that one knowledge engineer 
has written the concept definition 

ReaLman = Human □ Male □ 3 loves.Sports_car. 



'This restriction prevents the unifier from introducing cycles into the TBox. 
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to the TBox, whereas a second one has written the definition 

Stupid_man = Man □ 3 loves. (Car □ Fast), 

where all the concept names occurring on the left-hand side of these definitions are primitive 
concepts. Then the substitution that replaces Sports_car by CarFl Fast and Man by Human □ 
Male is a unifier of {ReaLman =^ Stupicl_man} w.r.t. the TBox T consisting of these two 
definitions. 

Using expansion, we can reduce unification modulo an acyclic TBox to unification 
without a TBox. In fact, the following lemma is an easy consequence of the fact that 
a{C^) = a{CY^'^'^ holds for all ^/I-concept terms C. 

Lemma 2.5. The substitution a is a unifier of {Ci =j- Di, . . . ,Cn =j- Dn} modulo T iff 
it is a unifier of {C^ =' D]^ , . . . , -C^}. 

Since expansion can cause an exponential blow-up, this is not a polynomial reduction. 
In the remainder of this subsection, we show that there actually exists a polynomial-time 
reduction of unification modulo an acyclic TBox to unification without a TBox. 

We say that the £'£-unification problem F is in dag-solved form if it can be written as 
r = {Xi =^ Ci, . . . , Xn =^ Cn}, where Xi, . . . , Xn are distinct concept variables such that, 
for all i < n, Xi does not occur in Ci, . . . , C„. For i = 1, ... ,n, let ai be the substitution 
that maps Xi to Ci and leaves all other variables unchanged. We define the substitution 
or as 

ariX,) ■.= ani---i(TiiXi))---) 
for i = 1, . . . , n, and ar{X) := X for all other variables X. The following is an instance of 
a well-known fact from unification theory |22j. 

Lemma 2.6. Let F = {Xi =^ Ci, • • • ,Xn =^ C„} be an EC-unification problem in dag- 
solved form. Then, the set {or} is a complete set of unifiers for F. 

There is a close relationship between acyclic TBoxes and unification problems in dag- 
solved form. In fact, if T is an acyclic TBox, then there is an enumeration Ai,... ,An of 
the defined concepts in T such that T = {^i = Ci, . . . ,An = Cn} and Ai does not occur 
in Cj, . . . , C„. Consequently, the corresponding unification problem 

F(r) := {^1 ='Ci,...,An =' Cn} 

(where Ai, . . . , An are now viewed as concept variables) is in dag-solved form. In addition, 
it is easy to see that, for any £^>C-concept term C, we have C^ = (Tr(7-)(C). 

Lemma 2.7. The EC-unification problem F = {Ci =^ Di, . . . ,Cn =7- Dn} is solvable 
modulo the acyclic TBox T iff {Ci Di, . . . , C„ =^ Dn} U F(T) is solvable^ 

Proof. Assume that is a unifier of F = {Ci =j- Di, . . . , C„ =j- Dn} modulo T. Then it 
is a unifier of F := {C]' =^ DJ, . . . ,Cn =^ D^}, by Lemma [231 Since Cj^ = OT{T){Ci) 

and DJ = (Tr(7-)(I?i), we have F = {oT{T)i^i) '^r(T)(^«) I 1 ^ ^ ^ Consequently, if 
we define the substitution r by setting t{X) := 0((Tr(7-)(^)) for all concept variables and 
defined concepts X, then r is a unifier of {Ci =^ Di, . . . , C„ =^ Dn}- In addition, since 
<Tr{T) is a unifier of F(T), r is also a unifier of {Ci Di, . . . , C„ =^ D^} U F(T). 



Note that the defined concepts of T are treated as concept constants in F, and as concept variables in 

{Ci Di,...,Cn =' D„}ur(r). 
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Conversely, assume that r is a unifier of {Ci =■ Di, . . . ,Cn =' Dn} U r(T). In partic- 
ular, this implies that r is a unifier of r(7'). By Lemma 12.61 {o'r(T)} ^ complete set of 
unifiers for T(T), and thus there is a substitution 6 such that t{X) = 9{aj-(^-j-j{X)) for all 
concept variables occurring in the unification problem {Ci Di, . . . ,C„ =^ Dn} U r(T). 
Since C/" = o"r(7-)(Cj) and DJ^ = aY(j-){Di)^ this implies that is a unifier of {Cj^ 
Dj,...,Cn D]^}, and thus of F = {Ci =^ Di, . . . ,Cn =j- Dn} modulo T, by 
Lemma 12.51 □ 

Since the size of r(T) is basically the same as the size of T, the size of F U F(7') is 
linear in the size of F and T- Thus, the above lemma provides us with a polynomial-time 
reduction of f /^-unification w.r.t. acyclic TBoxes to (J /^-unification. 

Theorem 2.8. EC-unification w.r.t. acyclic TBoxes can be reduced in polynomial time to 
EC-unification. 



3. Equivalence and subsumption in EC 

In order to characterize equivalence of EC-concept terms, the notion of a reduced EC- 
concept term is introduced in [24j . A given f/2-concept term can be transformed into an 
equivalent reduced term by applying the following rules modulo associativity and commu- 
tativity of conjunction: 

C n T — > C for all £^/2-concept terms C 

An A ^ A for all concept names A £ Neon 

3r.C n3r.D ^ 3r.C for all f /^-concept terms C, D with C Q D 

Obviously, these rules are equivalence preserving. We say that the (?£-concept term D is 
reduced if none of the above rules is applicable to it (modulo associativity and commutativity 
of n), and that C can be reduced to D \i D can be obtained from C by applying the above 
rules (modulo associativity and commutativity of □). The £^/2-concept term D \s a reduced 
form of C if C can be reduced to D and D is reduced. The following theorem is an easy 
consequence of Theorem 6.3.1 on page 181 of [24J. 

Theorem 3.1. LetC,D be EC-concept terms, andC,D reduced forms ofC,D, respectively. 
Then C = D iff C is identical to D up to associativity and commutativity o/n. 

This theorem can also be used to derive a recursive characterization of subsumption in 
EC. In fact, if C C D, then C n D = C , and thus C and C n D have the same reduced 
form. Thus, during reduction, all concept names and existential restrictions of D must be 
"eaten up" by corresponding concept names and existential restrictions of C. 

Corollary 3.2. Let C = Ai n . . . n Ak n 3ri.Ci n . . . n Br^.C^ and D = BiH . . .n BfH 
3 si.Di n . . . n 3 Sn-Dn, where ^i, . . . , A^, Bi, . . . , Bi are concept names. Then C Q D iff 
{Bi, . . . , Bi} C {Ai, . . . , Afc} and for every j, 1 < j < n, there exists an i,l < i < m, such 
that Ti = Sj and Ci Q Dj . 

Note that this corollary also covers the cases where some of the numbers k, I, m, n are 
zero. The empty conjunction should then be read as T. The following lemma, which is an 
immediate consequence of this corollary, will be used in our proof that EC has unification 
type zero. 
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Lemma 3.3. IfC,D are reduced EC-concept terms such that 3r.D Q C, then C is either 
T, or of the form C = 3r.Ci □ . . . □ 3r.C„ where n > 1; Ci, . . . ,C„ are reduced and 
pairwise incomparable w.r.t. subsumption; and D C Ci,...,D C C„. Conversely, ifC,D 
are SC-concept terms such that C = 3 r.Ci □ . . . □ 3 r.C„ and D C Ci, . . . ,D Q Cn, then 
3r.D □ C. 

The following lemma states several other obvious consequences of Corollarv l3.2[ 
Lemma 3.4. 

(1) The existential restriction 3r.C is reduced iff C is reduced. 

(2) Let Ci n . . . n Cn be concept names or existential restrictions. Then the conjunction 
Ci n . . . n Cn is reduced iff Ci, . . . ,Cn are reduced and pairwise incomparable w. r. t. 
subsumption. 

(3) Let C = Ci n . . . n Cm and D = Di n . . . n Dfi be conjunctions of EC-concept terms. 
If, for all i,l < i < n, there exists j, 1 < j < m, such that Cj Q Di, then C Q D. 
If Di, . . . ,Dn are concept names or existential restrictions, then the implication in the 
other direction also holds. 

In the proof of decidability of f /^-unification, we will make use of the fact that the 
inverse strict subsumption order is well-founded. 

Proposition 3.5. There is no infinite sequence Co, Ci, C2, C3, . . . of EC-concept terms such 
that Co C Ci C C2 C C3 C • • • . 

Proof. We define the role depth of an £'£-concept term C as the maximal nesting of exis- 
tential restrictions in C. Let no be the role depth of Cq. Since Co !^ Cj for i > 1, it is an 
easy consequence of Corollarv 13.21 that the role depth of Cj is bounded by no, and that Cj 
contains only concept and role names occurring in Co. In addition, it is known that, for a 
given natural number no and finite sets of concept names Neon and role names N^oie 1 there 
are, up to equivalence, only finitely many £'>C-concept terms built using concept names from 
C and role names from TZ and of a role depth bounded by no [15j. Consequently, there are 
indices i < j such that Cj = Cj. This contradicts our assumption that Cj C Cj. □ 

4. An .^^-unification problem of type zero 

To show that EC has unification type zero, we exhibit an £'£-unification problem that 
has this type. 

Theorem 4.1. Let X,Y be variables. The EC-unification problem T := {X □ 3r.Y =^ 
3r.y} has unification type zero. 

Proof. It is enough to show that any complete set of unifiers for this problem is redundant, 
i.e., contains two different unifiers that are comparable w.r.t. the instantiation preorder. 
Thus, let M be a complete set of unifiers for T. 

First, note that M must contain a unifier that maps X to an f>C-concept term not 
equivalent to T or 3r.T. In fact, consider a substitution r such that t{X) = 3r.A and 
r(y) = A. Obviously, r is a unifier of T. Thus, M must contain a unifier a such that a ^ t. 
In particular, this means that there is a substitution A such that 3r.A = t{X) = X{a(X)). 
Obviously, cr{X) = T would imply X{a{X)) = T, and thus 3r.A = T, which is, however. 
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not the case. Similarly, (j{X) = 3 r.T would imply \{a{X)) = 3 r.T, and thus 3 r.A = 3 r.T, 
which is also not the case. 

Thus, let (T G M be such that (j{X) ^ T and (t{X) ^ 3 r.T. Without loss of generality, 
we assume that C := ct{X) and D := cr{Y) are reduced. Since o" is a unifier of F, we have 
3 r.D C C. Consequently, Lemma 13.31 yields that C is of the form C = 3 r.Ci □ . . . □ 3 r.C„ 
where n > 1, Ci, . . . , C„ are reduced and pairwise incomparable w.r.t. subsumption, and 
DQCi,...,DrCn. 

We use a to construct a new unifier a as follows: 

a{X) := 3r.Cin...n3r.C„n3r.Z 
a(Y) := DnZ 

where Z is a new variable (i.e., one not occurring in C,D). The second part of Lemma 13.31 
implies that a is indeed a unifier of F. 

Next, we show that a ^ a. To this purpose, we consider the substitution A that 
maps Z to Ci, and does not change any of the other variables. Then we have X(jj{X)) = 
3r.Cin...n3r.C„n3r.Ci = 3 r.Cin. . .n3 r.C„ = a{X) and A(?(y)) = DnCi = D = a{Y). 
Note that the second equivalence holds since we have D Q Ci. 

Since M is complete, there exists a unifier 6 £ M such that 9 ^a. Transitivity of the 
relation ^ thus yields 6 ^ a. Since a and 9 both belong to M, we have completed the proof 
of the theorem once we have shown that a ^ 6. Assume to the contrary that a = 6. Then 
we have a ^d, and thus there exists a substitution ^ such that ^{a{X)) = d{X), i.e., 

3 r./x(Ci) n . . . n 3 r.fi[Cn) = 3 r.Ci n . . . n 3 r.C„ n 3 r.Z. (4.1) 

Recall that the concept terms Ci,...,Cn are reduced and pairwise incomparable w.r.t. 
subsumption. In addition, since (j{X) = 3 r.Ci n . . . □ 3r.C„ is reduced and not equivalent 
to 3 r.T, none of the concept terms Ci, . . . , C„ can be equivalent to T. Finally, Z is a concept 
name that does not occur in Ci, . . . , C^- All this implies that 3 r.Ci □ . . . □ 3 r.C^ n 3 r.Z is 
reduced. Obviously, any reduced form for 3r.//(Ci) □ . . . □ 3r.fi{Cn) is a conjunction of at 
most n existential restrictions. Thus, Theorem 13.11 shows that the above equivalence (14. ip 
actually cannot hold. 

To sum up, we have shown that M contains two distinct unifiers a, 9 such that 9 ^ a. 
Since M was an arbitrary complete set of unifiers for F, this shows that this unification 
problem cannot have a minimal complete set of unifiers. □ 

5. The decision problem 

Before we can describe our decision procedure for £'£-unification, we must introduce 
some notation. An £'£-concept term is called an atom iff it is a concept name (i.e., concept 
constant or concept variable) or an existential restriction 3r.DQ Obviously, any iS£-concept 
term is (equivalent to) a conjunction of atoms, where the empty conjunction is T. The set 
At{C) of atoms of an EC-concept term C is defined inductively: if C = T, then At{C) := 0; 
if C is a concept name, then At{C) := {C}; ff C = 3 r.D then At{C) := {C} U At{D); if 
C = Ci n C2, then At{C) := At{Ci) U At{C2). 

Concept names and existential restrictions 3 r.D where D is a concept name or T are 
called flat atoms. An iS£-concept term is flat iff it is a conjunction of fiat atoms (where the 
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12 



F. BAADER AND B. MORAWSKA 



empty conjunction is T). The f /^-unification problem T is flat iff it consists of equations 
between flat £'i2-concept terms. By introducing new concept variables and eliminating 
T, any £" /^-unification problem F can be transformed in polynomial time into a flat £C- 
unification problem T' such that T is solvable iff V is solvable. Thus, we may assume 
without loss of generality that our input f/2- unification problems are fiat. Given a fiat £C- 
unification problem F = {Ci =■ Di, . . . ,Cn =' Dn}, we call the atoms of Ci, Di, . . . , C„, Dn 
the atoms ofT. Atoms of F that are not variables (i.e., not elements of N^j) are called non- 
variable atoms ofT. 

The unifier a of F is called reduced iff, for all concept variables X occurring in F, the 
/^-concept term cr(X) is reduced. It is ground iff, for all concept variables X occurring in 
F, the i£'£-concept term ct(X) does not contain variables. Obviously, F is solvable iff it has 
a reduced ground unifier. Given a ground unifier a of F, the atoms of a are the atoms of 
all the concept terms where X ranges over all variables occurring in F. 

Remark 5.1. In the following, we consider situations where all occurrences of a given 
reduced atom in a reduced concept term C are replaced by a more general concept 
term, i.e., by a concept term D' with D IZ D' . However, when we say occurrence of D 
in C, we mean occurrence modulo equivalence (=) rather than syntactic occurrence. For 
example, if C = 3r.{An B) n3r.{B H A), D = 3r.{An B), and D' = 3r.A, then the term 
obtained by replacing all occurrences of -D in C by D' should be 3r.A □ 3r.A, and not 
3r.A n3r.{B n A). Since C and D are reduced, equivalence is actually the same as being 
identical up to associativity and commutativity of In particular, this means that any 
concept term that (syntactically) occurs in C and is equivalent to the atom D is also an 
atom, i.e., only atoms can be replaced by D' . In order to make this meaning of occurrence 
explicit we will call it occurrence modulo in the following. We will write Di =ac -^2 to 
express that the atoms Di and D2 are identical up to associativity and commutativity of 
n. Obviously, Di =ac D2 implies Di = D2. 

Lemma 5.2. Let C,D,D' be EC-concept terms such that D is a reduced atom, D n D' , 
and C is reduced and contains at least one occurrence of D modulo AC. If C is obtained 
from C by replacing all occurrences of D by D' , then C IZ C . 

Proof. We prove the lemma by induction on the size of C. If C =ac D, then C = D', 
and thus C = D \Z D' = C , which yields C \Z C. Thus, assume that C ^ac D. In 
this case, C cannot be a concept name since it contains the atom D. If C = Br.Ci, then 
D occurs in Ci modulo AC. By induction, we can assume that Ci IZ C(, where C( is 
obtained from Ci by replacing all occurrences of D (modulo AC) by D' . Thus, we have 
C = 3r.Cin3r.C[ = C hy Corollary [321 Finally, assume that C = Ci n . . . n forn > 1 
atoms Ci,...,Cn. Since C is reduced, these atoms are incomparable w.r.t. subsumption, 
and since the atom D occurs in C modulo we can assume without loss of generality 
that D occurs in Ci modulo AC. Let C(, . . . , be respectively obtained from Ci, . . . , 
by replacing every occurrence of D (modulo AC) by D', and then reducing the concept 
term obtained this way. By induction, we have Ci Z C{. Assume that C ^ C . Since the 
concept constructors of EC are monotone w.r.t. subsumption Z, we have C Z C", and thus 
C \^ C means that C = C . Consequently, C = Ci □ . . . Zl C„ and the reduced form of 
n . . . n must be equal up to associativity and commutativity of Zl. If Zl . . . Zl is not 
reduced, then its reduced form is actually a conjunction of m < n atoms, which contradicts 
C = C . If C( n . . . n is reduced, then Ci Z C[ implies that there is an i / 1 such that 
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Ci = C'l- However, then Cj = C( □ Ci contradicts the fact that the atoms Ci, . . . , C„ are 
incomparable w.r.t. subsumption. □ 

Proposition 13.51 says that the inverse strict subsumption order on concept terms is 
weh-founded. We use this fact to obtain a weh-founded strict order >- on ground unifiers. 

Definition 5.3. Let a^O be ground unifiers of T. We define 

(1) a y 9 iff cr{X) C holds for all variables X occurring in T. 

(2) a y 9 iS a ^ 9 and 9 ^ a, i.e., iff (t{X) C 9{X) holds for all variables X occurring in 
r, and cr{X) C 9{X) holds for at least one variable X occurring in T. 

If r contains n variables, then y is the n-fold product of the order C with itself. Since 
the strict part IZ of the inverse subsumption order C is well-founded by Proposition 13.51 the 
strict part ;^ of ^ is also well-founded [13]. The ground unifier o" of F is called is-minimal 
iff there is no ground unifier 9 oi T such that a >- 9. The following proposition is an easy 
consequence of the fact that >~ is well-founded. 

Proposition 5.4. Let T he an EC-unification problem. Then T is solvable iff it has an 
is-minimal reduced ground unifier. 

In the following, we show that is-minimal reduced ground unifiers of flat £" /^-unification 
problems satisfy properties that make it easy to check (with an NP-algorithm) whether such 
a unifier exists or not. 

Lemma 5.5. Let V he a flat EC-unification problem and 7 an is-minimal reduced ground 
unifier of T. Lf C is an atom of 7, then there is a non-variable atom D of T such that 
C^l{D). 

The main idea underlying the proof of this crucial lemma is that an atom C of a unifier 
a that violates the condition of the lemma (i.e., that is not of the form C = j{D) for a 
non- variable atom D of T) can be replaced by a concept term D such that C n D, which 
yields a unifier of T that is smaller than a w.r.t. y. 

Before proving the lemma formally, let us illustrate this idea by two examples. 

Example 5.6. First, consider the unification problem 

Fi := {3r.X HBr.A =^ 3r.X}. 

The substitution ai := {X ^ An B} is a unifier of Fi that does not satisfy the condition 
of Lemma 15.51 In fact, B is an atom of fii, but none of the non- variable atoms D of Fi 
(which are A, 3r.A, and 3r.X) satisfy B = ai{D). The unifier ai is not is-minimal since 
7i := {X I—)- A}, which can be obtained from o"i by replacing the offending atom B with 
T, is a unifier of Fi that is smaller than cri w.r.t. >-. The unifier 71 is is-minimal, and it 
clearly satisfies the condition of Lemma 15.51 
Second, consider the unification problem 

F2 :={Xn3 r.A n 3 r.B =^ X}. 

The substitution o"2 := {X 1— )• 3r.{An B)} is a unifier of F2 that does not satisfy the 
condition of Lemma 15.51 In fact, 3r.{AV^ B) is an atom of o"2, but none of the non- variable 
atoms D of F2 (which are A, B, 3r.A, and 3r.B) satisfy 3r.(A n B) = a2{D). The unifier 
(T2 is not is-minimal since 72 := {X 1-^ 3r.A □ 3r.B}, which can be obtained from (T2 
by replacing the offending atom 3r.(A □ B) with 3r.A □ 3r.B, is a unifier of F2 that is 
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smaller than a2 w.r.t. The unifier 72 is is-minimal, and it clearly satisfies the condition 
of Lemma 15.51 

Proof of Lemma \5.5[ Assume that 7 is an is-minimal reduced ground unifier of T. Since 7 
is reduced, all atoms of 7 are reduced. In particular, this implies that C is reduced, and 
since 7 is ground, we know that C is either a concept constant or an existential restriction. 

Firsts assume that C is of the form A for a concept constant A, but there is no non- 
variable atom Z) of r such that A = ^{D). This simply means that A does not appear in 
r. Let 7' be the substitution obtained from 7 by replacing every occurrence of A by T. 
Since equivalence in 8C is preserved under replacing concept names by T, and since A does 
not appear in L, it is easy to see that 7' is also a unifier of T. However, since 7 >- 7', this 
contradicts our assumption that 7 is is-minimal. 

Second^ assume that C is an existential restriction of the form Br.Ci, but there is no 
non- variable atom D oi T such that C = ^{D). We assume that C is maximal (w.r.t. 
subsumption) with this property, i.e., for every atom C of 7 with C IZ C", there is a non- 
variable atom D' of T such that C = ^{D'). Let Di, . . . , Di be all the non- variable atoms 
of r with C Q jiDi) (i = 1, . . . ,£). By our assumptions on C, we actually have C C ^{Di) 
and, by Lemma [3.31 the atom Di is also an existential restriction Di = 3r.D[ (i = 1, . . . 
We consider the conjunction 

D ■.= j{Di)n...nj{D^), 

which is T in case i = 0. 

Definition 5.7. Given an £^>C-concept term F, the concept term fI*^/^] is obtained from F 
by replacing every occurrence of C (modulo AC) hy D. The substitution obtained 
from 7 by replacing every occurrence of C (modulo AC) by D, i.e., 7[^/^](X) :=7(X)[^/^] 
for all variables X. 

We will show in the following that 7[^/^] is a unifier of F that is smaller than 7 w.r.t. 
This will then again contradict our assumption that 7 is is-minimal. 

Lemma 5.8. 7 y -yi'^/^'i. 

Proof. Obviously, D subsumes C. We claim that this subsumption relationship is actually 
strict. In fact, if £ = 0, then D = T, and since C is an atom, it is not equivalent to T. 
If £ > 1, then C = 3r.Ci □ 3r.j{D[) n . . . n 3r.j{D'i) would imply (by Corollary K2\i 
that there is an i, 1 < i < ^, with Ci □ ^{D[). However, this would yield C = 3r.Ci □ 
3r.^{D[) = 'y{Di), which contradicts the fact that C IZ j{Di). Thus, we have shown that 
C \Z D. Lemma 15.21 implies that j y 'j'. □ 

To complete the proof of Lemma 15.51 it remains to show the next lemma. 

Lemma 5.9. 7['^/^l is a unifier ofT. 

Proof. Consider an equation in T of the form Li □ . . . PI L,„ =^ i2i □ . . . □ i?„ where Li , . . . , Lm 
and Ri, . . . , Rn are fiat atoms, and define L := j{Lin. . .riLm) and R := 7(i2in. . .nRn). We 
know that L, R are conjunctions of atoms of the form L = AiH. . .Flyl^ and R = BiH. . .HBu, 
where each conjunct Ai, . . . , A^, Bi, . . . , is a reduced ground atom that is either an atom 
of 7 or equal to j{E) for a non-variable atom E of T. Since 7 is a unifier of F, we have 
L = R. 
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(1) Sincere is an atom, we obviously have LI^/^I = Af'^^ n . . . n A^^^ and R^^'^"^ = 
Bf'^^ n . . . n /^l. Now, we show that = ^^^I^\Li n . . . n L^) and ijl^/^l = 

^^'^/^\Ri n . . . n We concentrate on proving the first identity since the second 

one can be shown analogously. To show the first identity, it is enough to prove that 
7(Lj)[^/^l = 7[^/-^](Lj) holds for ah j, 1 < j < m. 

(a) If Lj is a variable X, then -i^'^/^\X) = 7(X)[^/^1 holds by the definition of 71^/^1 . 

(b) If Lj is a concept constant A, then ^I*^/^] = A since C is an existential restriction. 

Thus, we have -i\-^/^\A) = A = ^[^/^l = 7(A)[^/^1. 

(c) Otherwise, Lj is an existential restriction 3rj.L'j. By our assumption on C, we 

have C ^ 7(Lj), and thus 7(Lj)[^/-^] = Brj. (^j{Lj)^'^/^^y In addition, we have 

j^^/^^Lj) = 3 rj. 71^/^1 (L^.)- Thus, it is enough to show j{L'j)^^/^^ = -i^^'^\L'-). 
Since Lj is a flat atom, we know that V- is either a concept constant, the top-concept 

T, or a concept variable. In the first to cases, we can show 7(Lj)['^/^] = ^^^I^\L'-) 
as in (jlbp . and in the third case we can show this identity as in (jlap . 

(2) Because of ([1]), if we can prove thatL^/^l =i2[^/^],then we have shown that Y 
solves the equation Li □ . . . □ Lm i?i □ . . . □ 

Without loss of generality, we concentrate on showing that LI^Z-^I C bmce 
L^c/D] ^ j^[C/D] n . . . n ^[f/^] and = i?f n . . . n Bf'^\ it is thus sufficient 

to show that, for every i, 1 < i < u, there exists a j, 1 < j < fi, such that A^j^^^^ Q 

Bf^^^ (see (3) of Lemma [33]). Since L = Ai n . . . n Q Bi n . . . n B^ = R and 
Ai, . . . , A^, . . . , Bi, are atoms, we actually know that, for every i,l < i < i', there 
exists a j, 1 < j < /x, such that Aj C Bi. Thus, it is sufficient to show that Aj C Bi 

implies A^j^^^^ Q B^^^^"^ . This is an easy consequence of the next lemma since Ai , Bj 
satisfy the conditions of this lemma. □ 

Lemma 5.10. Let A,B be reduced ground atoms such that B is an atom of 'j or of the 
form 7(L>) for a non-variable atom D ofV. If A^ B, then AI*^/^! C B^'-^/^^ 

Proof. We show ^I'^/^l IZ i^PZ-^l by induction on the size of A. 

(1) First, assume that A =ac C, which implies that ^[^/^] = D = ^{Di) n . . . n j{Dn)- 

(a) If B is of the form B = ^{D) for a non-variable atom D of P, then there is an 
h,l < h <n, such that D = Dh, which shows that AI*^/^] □ B. Since C Q D and 
the constructors of £C are monotone w.r.t. subsumption, we also have B C ijt*^/^!, 
and thus ^!^/^] 

(b) Assume that B is an atom of 7. If B =ac C, then = D, and thus A'^'^/^^ = 
BI'^'/^\ which implies ^P/^l C B^'^/^l Otherwise, since C, B are reduced atoms, 
B C implies B ^ C. Together with C = A Q B, this shows that C n B. 
Thus, the maximality of C implies that there is a non- variable atom D of T such 
that B = 7(^0) . Thus, we are actually in case (a), which yields A^'^/^^ □ 5^/-°]. 

(2) Now, assume that A ^ac C. If there is no occurrence (modulo AC) of C in A, then 
we have A^^/^^ = A IZ B H B^^/^l 
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Otherwise, A is of the form A = 3 s.E and C occurs in E (modulo AC). Obviously, 
A ^ B then implies that B is of the form B = 3 s.F with E Q F. The concept 
terms E, F are conjunctions of reduced ground atoms, i.e., E = EiH . . .n Ef^ and F = 
Fi n . . . n Fx where Ei, . . . , E^^, Fi, . . . , Fx are reduced ground atoms. By Corollary 13.21 
for every h,l < h < X, there exists k,l < k < k, such that E^ ^ F^. 

In order to be able to assume, by induction, that E^ E Fh implies E^^^^^ C F^^^\ 
we must show that the conditions in the statement of the lemma hold for the concept 
terms Ek,Fh, where Ek plays the role of A and Fh plays the role of B. Since we already 
know that Ei, . . . , E^,Fi, . . . ,Fx are reduced ground atoms, it is sufficient to show that 
each of the atoms Fi,. . . ,F\ is an atom of 7 or of the form ^{D) for a non- variable 
atom D of T. We know that B = 3 s.{Fi n . . . n F\) is an atom of 7 or an instance 
(w.r.t. 7) of a non- variable atom of F. In the first case, the atoms Fi, . . . ,Fx are clearly 
also atoms of 7. In the second case, B = 7(-D') for a non-variable atom D' of F. If D' 
is a ground atom, then Fi, . . . ,Fx are also ground atoms that are atoms of F, and thus 
they are instances (w.r.t. 7) of non-variable atoms of F. Otherwise, since F is fiat, D' 
is of the form 3 s.X for a variable X and 'y{X) = FiH . . . n Fx- In this case, Fi, . . . ,Fx 
are clearly atoms of 7. 

Thus, we can assume by induction: 

(*) for every h,l < h < X, there exists k,l < k < k such that E^^^^"^ C f]^^^^ 
It remains to show that this implies A^'^/^'^ C B^'-'/^\ 

(a) If B +AC C, then ^I^/^l = 3 s.(i?P^l n . . . n i^F^^) and i?!^/^] = 3 s.(Ff n 

. . . n Ff^^), and thus property (*) yields A^'^l^^ C B^^'^^ 

(b) Assume that B =ac C. In this case, C cannot occur (modulo AC) in any of the 

concept terms Fi, . . . , Fh, which implies that B = 3 s.{Fi □ . . . □ Fx) = 3 s.{Ff^^^'^ □ 

. . . nF|^/^^). Since we have A^^/^^ = 3 s.{Ef'^^ n . . . n E^^'^^), property (*) yields 

Since we also have B C B^^/'^\ this yields ^l^/^l □ B^^l^\ 

Thus, we have shown in all cases that A^'^/^'^ C B^^l^\ which completes the proof of 
Lemma 15.101 □ 

Overall, we have thus completed the proof of Lemma 15.51 The next proposition is an 
easy consequence of this lemma. 

Proposition 5.11. Let T he a flat EC-unification problem and 7 an is-minimal reduced 
ground unifier o/F. If X is a concept variable occurring in T, then 7(X) = T or there are 
non-variable atoms Di, . . . , D„ (n > 1) ofT such that 7(X) = 7(-Di) □ . . . □ j{Dn)- 

Proof. If 'y{X) ^ T, then it is a non-empty conjunction of atoms, i.e., there are atoms 
Ci, . . . ,Cn {n > 1) such that ^{X) = Ci □ . . . FI Cn- Then Ci, . . . ,Cn are atoms of 7, 
and thus Lemma 15.51 vields non- variable atoms Di, . . . ,Dn of F such that Ci = ^{Di) for 
i = 1,. . .n. Consequently, j{X) = ^{Di) n . . . n j{Dn). □ 

This proposition suggests the following non-deterministic algorithm for deciding solv- 
ability of a given fiat £'>C-unification problem. 

Algorithm 5.12. Let F be a fiat £^ /^-unification problem. 
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(1) For every variable X occurring in F, guess a finite, possibly empty, set Sx of non-variable 
atoms of F. 

(2) We say that the variable X directly depends on the variable Y \iY occurs in an atom of 
Sx- Let depends on be the transitive closure of directly depends on. If there is a variable 
that depends on itself, then the algorithm returns "fail." Otherwise, there exists a strict 
linear order > on the variables occurring in F such that X > Y \{ X depends on Y . 

(3) We define the substitution a along the linear order >: 

• If X is the least variable w.r.t. >, then Sx does not contain any variables. We define 
(t(X) to be the conjunction of the elements of Sx^ where the empty conjunction is T. 

• Assume that cr{Y) is defined for all variables Y < X. Then Sx only contains variables 
Y for which (j{Y) is already defined. If Sx is empty, then we define ct{X) := T. 
Otherwise, let Sx = {Di,.. We define a{X) := a{Di) n . . . n a{Dn). 

(4) Test whether the substitution a computed in the previous step is a unifier of F. If this 
is the case, then return a; otherwise, return "fail." 

This algorithm is trivially sound since it only returns substitutions that are unifiers of 
F. In addition, it obviously always terminates. Thus, to show correctness of our algorithm, 
it is sufficient to show that it is complete. 

Lemma 5.13 (Completeness). IfT is solvable, then there is a way of guessing in Step 1 
subsets Sx of the non-variable atoms of F such that the depends on relation determined in 
Step 2 is acyclic and the substitution a computed in Step 3 is a unifier ofT. 

Proof. If F is solvable, then it has an is-minimal reduced ground unifier 7. By Proposi- 
tion [5TTT1 for every variable X occurring in F we have 'y{X) = T or there are non- variable 
atoms £>!,..., (n > 1) of F such that j{X) = j{Di) n . . . n 7(Dn)- If liX) = T, then 
we define Sx '■= 0- Otherwise, we define Sx '■= {Di, • • • , Dn}- 

We show that the relation depends on induced by these sets Sx is acyclic, i.e., there is 
no variable X such that X depends on itself. If X directly depends on Y, then Y occurs in 
an element of Sx- Since Sx consists of non- variable atoms of the fiat unification problem 
F, this means that there is a role name r such that 3r.Y G Sx- Consequently, we have 
7(X) C 3 r.j{Y). Thus, if X depends on X, then there are k > 1 role names ri, . . . , such 
that j{X) C 3ri. • • • 3rk-'y{X). This is clearly not possible since 'y{X) cannot be subsumed 
by an £^>C-concept term whose role depth is larger than the role depth of j{X). 

To show that the substitution a induced by the sets Sx is a unifier of F, we prove 
that a is equivalent to 7, i.e., cr{X) = j{X) holds for all variables X occurring in F. The 
substitution a is defined along the linear order >. If X is the least variable w.r.t. >, then 
the elements of Sx do not contain any variables. If Sx is empty, then cr{X) = T = j{X). 
Otherwise, let Sx = {Di, - - - ,Dn}- Since the atoms Di do not contain variables, we have 
Di = ■~f{Di). Thus, the definitions of Sx and of a yield = Di □ . . . □ D„ = 7(-Di) n 

...n7(Z)„)^7(X). 

Assume that cr{Y) = 'y{Y) holds for all variables Y < X. If Sx = 0, then we have 
again cr{X) = T = 7(X). Otherwise, let Sx = {Di, . . . , Dn}- Since the atoms Di contain 
only variables that are smaller than X, we have cr{Di) = j{Di) by induction. Thus, the 
definitions of Sx and of a yield cr(X) = a{Di)n. . .na{Dn) = 7(1)1) n. . .n7(D„) = 7(X).n 

Note that our proof of completeness actually shows that, up to equivalence, the algo- 
rithm returns all is-minimal reduced ground unifiers of F. 

Theorem 5.14. EC-unification is NP-complete. 
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Proof. NP-hardness follows from the fact that £'>C-matching is NP-complete [24] H To show 
that the problem can be decided by a non-deterministic polynomial-time algorithm, we 
analyze the complexity of our algorithm. Obviously, guessing the sets Sx (Step 1) can be 
done within NP. Computing the depends on relation and checking it for acyclicity (Step 2) 
is clearly polynomial. 

Steps 3 and 4 are more problematic. In fact, since a variable may occur in different 
atoms of r, the substitution a computed in Step 3 may be of exponential size. This is 
actually the same reason that makes a naive algorithm for syntactic unification compute 
an exponentially large most general unifier [16]. As in the case of syntactic unification, the 
solution to this problem is basically structure sharing. Instead of computing the substitution 
a explicitly, we view its definition as an acyclic TBox. To be more precise, for every concept 
variable X occurring in T, the TBox Ta contains the concept definition X = T if Sx = 
and X = DiH . . . \lDn if Sx = {Di, ■ ■ ■ , Dn} {n > 1). Instead of computing a in Step 3, we 
compute Ta- Because of the acyclicity test in Step 2, we know that Ta is an acyclic TBox. 
The size of Ta is obviously polynomial in the size of T, and thus this modified Step 3 is 
polynomial. 

It is easy to see that applying the substitution a to a concept term C is the same as 
expanding C w.r.t. the TBox Ta, i.e., cr{C) = C^"^ . This implies that, for every equation 
C =^ D in r, we have C =j-^ D iff ct(C) = (^[D). Thus, testing in Step 4 whether u is a 
unifier of T can be reduced to testing whether C =7^ D holds for every equation C =^ D 
in r. Since subsumption (and thus equivalence) in £C w.r.t. acyclic TBoxes can be decided 
in polynomial time [4J, this completes the proof of the theorem. □ 

In Subsection 12.31 we have shown that there exists a polynomial-time reduction of 
unification modulo an acyclic TBox to unification without a TBox. Thus, Theorem 15.141 
also yields the exact complexity for f/3-unification w.r.t. acyclic TBoxes. 

Corollary 5.15. EC-unification w.r.t. acyclic TBoxes is NP-complete. 

Proof. The problem is in NP since Theorem 12.81 states that there is a polynomial-time 
reduction of £'£-unification w.r.t. acyclic TBoxes to (?£-unification, and we have just shown 
that f £-unification is in NP. 

NP-hardness for f /^-unification w.r.t. acyclic TBoxes follows from NP-hardness of £C- 
unification since £^£-unification can be viewed as the special case of f /^-unification w.r.t. 
acyclic TBoxes where the TBox is empty. □ 



6. A GOAL-ORIENTED ALGORITHM 

The NP-algorithm introduced in the previous section is a typical "guess and then test" 
NP-algorithm, and thus it is unlikely that a direct implementation of this algorithm will 
perform well in practice. Here, we introduce a more goal-oriented unification algorithm for 
£C, in which non-deterministic decisions are only made if they are triggered by "unsolved 
parts" of the unification problem. 

As in the previous section, we assume without loss of generality that our input unifica- 
tion problem Tq is flat. For a given flat equation C =^ D, the concept terms C,D are thus 

^The NP-hardness proof in [24j is by reduction of SAT. This reduction employs two concept constants 
and four role names. However, the roles are mainly used to encode several (matching) equations into a single 
one. When using a set of equations rather than a single equation, one role name is sufficient. 
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conjunctions of flat atoms. We will often view such an equation as consisting of four sets: 
the left-hand side C is given by the set of variables occurring in the top-level conjunction of 

C, together with the set of non-variable atoms occurring in this top-level conjunction; the 
right-hand side D is given by the set of variables occurring in the top-level conjunction of 

D, together with the set of non-variable atoms occurring in this top-level conjunction. To 
be more precise, let e denote the equation C =D, where C = Xi □ . . . □ n AiH . . .n A/^ 
and D = Yi n . . . n Yn n Bi n . . . n Bi for concept variables Xi, . . . , Xm, Yi, . . . ,Yn and 
non- variable atoms Ai, . . . , A/., Bi, . . . B^. Then we define 

LVar{e) := {Xi, . . . , RVar{e) := {Yi, . . . ,Yn}, 

LAto{e) :={Ai,..., Ak}, RAto{e) := {^i, . . . B^}. 

Obviously, the equation e : C =^ D \s uniquely determined (up to associativity, commutativ- 
ity, and idempotency of conjunction) by the four sets LVar{e),LAto{e),RVar[e)^RAto[e). 
Instead of viewing an equation e as being given by a pair of concept terms, we can thus 
also view it as being given by these four sets. In the following, it will often be convenient 
to employ this representation of equations. If, with this point of view, we say that we add 
an atom to the set LAto{e) or RAto{e), then this means, for the other point of view, that 
we conjoin this atom to the top-level conjunction of the left-hand side or right-hand side of 
the equation. In addition, if we say that the equation e contains the variable X, then we 
mean that X G LVar{e) U RVar{e). Similarly, if we say that the left-hand side of e contains 
X, then we mean that X G LVar{e), and if we say that the right-hand side of e contains 
X, then we mean that X £ i?Var(e))@ 

In addition to the unification problem itself, the algorithm also maintains, for every 
variable X occurring in the input problem Tq, a set Sx of non- variable atoms of Tq. Initially, 
all the sets Sx are empty. We call the set Sx the current assignment for X, and the collection 
of all these sets the current assignment. Throughout the run of our goal-oriented algorithm, 
we will ensure that the current assignment is acyclic in the sense that no variable depends 
on itself w.r.t. this assignment (see (2) of Algorithm I5.12p . An acyclic assignment induces 
a substitution a, as defined in (3) of Algorithm 15.121 We call this substitution the current 
substitution. Initially, the current substitution maps all variables to T. 

The algorithm applies rules that can 

(1) change an equation of the unification problem by adding non- variable atoms of the input 
problem Tq to one side of the equation; 

(2) introduce a new flat equation of the form CnB = B, where C, B are atoms of the input 
problem Tq or T; 

(3) add non- variable atoms of the input problem Tq to the sets Sx- 

Another property that is maintained throughout the run of our algorithm is that all equa- 
tions e are expanded w.r.t. the current assignment in the following sense: for all variables 
X we have 

A e Sx ^X e LVar{e) ^ Ae LAto{e) and Ae Sx /\X e RVar{e) ^ Ae RAto{e). 

Given a flat equation e that contains the variable X, the expansion of e w.r.t. the assignment 
Sx for X is defined as follows: if X G LVar{e) then all elements of Sx are added to LAto{e), 
and if X G RVar{e) then all elements of Sx are added to RAto{e). 



Note that occurrences of X inside non-variable atoms 3r.X £ LAto{e) U RAto{e) are not taken into 
consideration here. 
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The L-variant of the Eager- Assignment rule applies to the equation e if there is an 
unfinished variable X € LVar{e) such that 

• all variables Z € {LVar{e) \ {X}) U RVar{e) are finished; 

• LAto{e) = 0. 

Its application sets Sx '■= RAto{e). 

(1) If this makes the current assignment cyclic, then return "fail." 

(2) Otherwise, label X as finished and expand all equations containing X w.r.t. 
the new assignment for X. 

Figure 1: The Eager- Assignment rule in its L-variant. The R- variant is obtained by ex- 
changing the roles of the two sides of the equation. 

The following lemma is an immediate consequence of the definition of expanded equa- 
tions and of the construction of the current substitution. 

Lemma 6.1. // the equation C =^ D is expanded w.r.t. the current assignment, then 
LAto{C =■ D) = RAto(C =■ D) implies that the current substitution a solves this equation, 
i.e., a{C) =a{D). 

We say that an equation e is solved if LAto(e) = RAto{e). An atom A € LAto{e) PI 
RAto{e) is called solved in e; atoms A € LAto{e) U RAto{e) that are not solved in e are 
called unsolved in e. Obviously, an equation e is solved iff all atoms A G LAto{e) U RAto(e) 
are solved in e. 

Basically, in each step, the goal-oriented algorithm considers an unsolved equation and 
an unsolved atom in this equation, and tries to solve it. Picking the unsolved equation and 
the unsolved atom in it is don't care non-deterministic, i.e., there is no need to backtrack 
over such a choice. Once an unsolved equation and an unsolved atom in it was picked, 
don't know non-determinism comes in since there may be several possibilities for how to 
solve this atom in the equation, some of which may lead to overall success whereas others 
won't. In some cases, however, a given equation uniquely determines the assignment for a 
certain variable X. In this case, we make this assignment and then label the variable X as 
finished. This has the effect that the set Sx can no longer be extended. Initially, none of 
the variables occurring in the input unification problem is labeled as finished. We say that 
the variable X is unfinished if it is not labeled as finished. 

Algorithm 6.2. Let Tq be a flat £'/3-unification problem. We define T := Fq and Sx ■= 
for all variables X occurring in Fq. None of these variables is labeled as finished. 
As long as F contains an unsolved equation, do the following: 

(1) If the Eager-Assignment rule applies to some equation e, then apply it to this equation 
(see Figured]). 

(2) Otherwise, let e be an unsolved equation and A an unsolved atom in e. If neither 
of the rules Decomposition (see Figure [2|) and Extension (see Figure [3|) applies to A 
in e, then return "fail." If one of these rules applies to A in e, then (don't know) 
non-deterministically choose one of these rules and apply it. 

Once all equations of F are solved, return the substitution a that is induced by the current 
assignment. 

The Eager- Assignment rule is described in Figure [TJ Note that, after a non- failing 
application of this rule, the equation it was applied to is solved since the expansion of this 
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The L-variant of the Decomposition rule applies to the unsolved atom A in the equa- 
tion e if 

• ^ G LAto{e) \ RAto{e); 

• yl is of the form A = 3r.C; 

• there is at least one atom of the form 3r.B E RAto{e). 

Its application chooses (don't know) non-deterministically an atom of the form 3 r.B € 
RAto{e) and 

• adds 3r.C to RAto{e)\ 

• creates a new equation C n B ='■ B and expands it w.r.t. the assignments of all 
variables contained in this equation, unless this equation has already been generated 
before. If the equation has already been generated before, it is not generated again. 

Figure 2: The Decomposition rule in its L-variant. The R-variant is obtained by exchanging 
the roles of the two sides of the equation. 

equation w.r.t. the new assignment for X adds all elements of RAto{e) to LAto{e). As an 
example, consider the equations 

y=-T, Z=-3r.T, XUY =■ 

and assume that Sx = Sy = Sz = % and none of the three variables X, Y, Z is finished. 
An application of the Eager-Assignment rule to the first equation labels Y as finished, but 
does not change anything else. The subsequent application of the Eager-Assignment rule 
to the second equation changes the assignment for Z to Sz = {3r.T}, labels Z as finished, 
and expands the second and the third equation w.r.t. the new assignment for Z. Thus, we 
now have the equations 

y=-T, Zn3r.T =■ 3r.T, xny=-zn3r.T. 

Since Y, Z are finished, the Eager- Assignment rule can now be applied to the third equation. 
This changes the assignment for X to Sx = {3r.T}, labels X as finished, and adds 3r.~T 
to the left-hand side of the third equation. Now all equations are solved. The current 
assignment induces a substitution a with (t{X) = 3r.T = cr(Z) and cr{Y) = T, which is a 
unifier of the original set of equations. 

The Decomposition rule is described in Figure [2l This rule solves the unsolved atom 
A = 3 r.C by adding it to the other side. For this to be admissible, one needs a more specific 
atom 3 r.B on that side, where the "more specific" is meant to hold after application of the 
unifier. Thus, to ensure that the unifier a computed by the algorithm satisfies a{3r.B) C 
a{3r.C), the rule adds the new equation C n B =■ B. Obviously, if the substitution a 
solves this equation, then it satisfies a{B) C o"(C), and thus a(3r.B) C a{3r.C). As an 
example, consider the equation 

3r.X n3r.A 3r.A, 

and assume that Sx = and that X is unfinished. An application of the L-variant of the 
Decomposition rule to this equation adds 3 r.X to the right-hand side of this equation, and 
thus solves it. In addition, it generates the new equation X n A A, which is solved. 
The current assignment induces a substitution a with cr{X) = T, which solves the original 
equation. 
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The L-variant of the Extension rule apphes to the unsolved atom A of the equation e 
if 

• Ae LAto{e) \ RAto{e); 

• there is at least one unfinished variable X € RAto{e) 

Its application chooses (don't know) non-deterministically an unfinished variable X € 
RAto{e) and adds A to Sx- 

• If this makes the current assignment cyclic, then return "fail." 

• Otherwise, expand all equations containing X w.r.t. the new 
assignment for X. 

Figure 3: The Extension rule in its L-variant. The R- variant is obtained by exchanging the 
roles of the two sides of the equation. 

The Extension rule is described in Figure O Basically, this rule solves the unsolved 
atom A by extending with this atom the assignment of an unfinished variable contained in 
the other side of the equation. As an example, consider the equation 

AnBr.T =• 3r.TnX, 

where A is a concept constant, Sx = 0, and X is unfinished. An application of the Extension 
rule to A in this equation extends the assignment for X to Sx = {A}, and expands this 
equation by adding A to the right-hand side. The equation obtained this way is solved. The 
substitution a induced by the current assignment replaces X by A, and solves the original 
equation. 

Theorem 6.3. Alaorithm \6.2\ is an NP-algorithm for testing solvability of flat EC-unifica- 
tion problems. 

First, we show that the algorithm is indeed an NP-algorithm. For this, we consider 
all runs of the algorithm, where for every (don't care) non-deterministic choice exactly one 
alternative is taken. Since a single rule application can obviously be realized in polynomial 
time, it is sufficient to show the following lemma. 

Lemma 6.4 (Termination). Every run of the algorithm terminates after a polynomial num- 
ber of rule applications. 

Proof. Each application of the Eager- Assignment rule finishes an unfinished variable. Thus, 
since finished variables never become unfinished again, it can only be applied k times, where 
k is the number of variables occurring in the input unification problem Fq. This number is 
clearly linearly bounded by the size of Fq. 

Every application of the Decomposition rule or the Extension rule turns an unsolved 
atom in an equation into a solved one, and a solved atom in an equation never becomes 
unsolved again in this equation. For a fixed equation, in the worst case every atom of Fq 
may become an unsolved atom of the equation that needs to be solved. There is, however, 
only a linear number of atoms of Fq. Each equation considered during the run of the 
algorithm is either descended from an original equation of Fq, or from an equation of the 
form C n B =^ B for atoms 3r.B and 3r.C of Fq. Thus, the number of equations is also 
polynomially bounded by the size of Fq. Overall, this shows that the Decomposition rule 
and the Extension rule can only be applied a polynomial number of times. □ 
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Next, we show soundness of Algorithm 16.21 We call a run of this algorithm non-failing 
if it terminates with a unification problem containing only solved equations. 

Lemma 6.5 (Soundness). Let Fq he a flat EC-unification problem. The substitution a 
returned after a successful run of Alaorithm \6.^ on input Tq is an £C-unifier ofTQ. 

Proof. First, note that the rules employed by Algorithm 16.21 indeed preserve the two invari- 
ants mentioned before: 

(1) the current assignment is always acyclic; 

(2) all equations are expanded. 

In fact, whenever the current assignment is extended, the rules test acyclicity (and return 
"fail," if it is not satisfied). In addition, they expand all equations w.r.t. the new assignment. 

Now, assume that the run of the algorithm has terminated with the £" /^-unification 
problem T, in which all equations are solved. The first invariant ensures that the final 
assignment constructed by the run is acyclic, and thus indeed induces a substitution a. 
Because of the second invariant, Lemma |6 . II applies . and thus we know that a is a solution 
off. 

It remains to show that the substitution a is also a solution of the input problem Tq. To 
this purpose, we take all the equations that were considered during the run of the algorithm, 
i.e., present in Tq or in any of the other unification problems generated during the run. Let 
£ denote the set of these equations. We define the relation — )• on £" as follows: e — )• e' 
if e was transformed into e' using one of the rules of Algorithm 16.21 To be more precise, 
the Eager- Assignment rule transforms equations containing X from the current unification 
problem F by expanding them w.r.t. the new assignment for X. The same is true for the 
Extension rule. The decomposition rule transforms an equation e containing the unsolved 
atom A = 3 r.C by adding this atom to the other side, which needs to contain an atom 
of the form 3r.B. For this new equation e', we have e e' . The decomposition rule 
may also generate a new equation e" of the form C n B =■ B (\i this equation was not 
generated before). However, we do not view this equation as a successor of e w.r.t. — >, i.e., 
we do not have e e". Equations C n B =^ B that are generated by an application of 
the decomposition rule are called D- equations. Equations that are elements of the input 
problem Fq are called I- equations. Any equation e' that is not an I-equation or a D-equation 
has a unique predecessor w.r.t. — i.e., there is an equation e G £" such that e e' . 

Starting with the set := F we will now step by step extend J-" by a predecessor of 
an equation in until no new predecessors can be added. Since £ is finite, this process 
terminates after a finite number of steps. After termination we have £ = and thus in 
particular Fq C J^. This is due to the fact that, for every element eo of there are n > 
elements ei, . . . , e„ S £ such that eo — > ei ^ . . . ^ and € F. Thus, it is enough to 
show that the set J- satisfies the following invariant: 

(*) the substitution a solves every equation in T . 

Since o" is a solution of F, this invariant is initially satisfied. To prove that it is preserved 
under adding predecessors of equations in T ^ we start with the equations of minimal role 
depth. To be more precise, if the equation e is of the form C = ■ D, we define the role depth 
of e w.r.t. a to be the role deptlFl of the concept term a{C) □ cr(D). The strict order >- on 
£ is defined as follows: e e' iff the role depth of e w.r.t. a is larger than the role depth 



see the proof of Proposition 13.51 for a definition. 
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of e' w.r.t. cr. We write e &' if e and e' have the same role depth w.r.t. a. The following 
is an easy consequence of the definition of a and of our rules: 

(**) ei ->■ 62 ^ ...-)■ e„ implies ei R^o- 62 ~ct • • • ~o- Cn- 

Assume that we have already constructed a set F such that the invariant (*) is satisfied. 

Let e' be an equation in F such that 

• there is an e G £^ \ with e — > e'; 

• e' is of minimal role depth with this property, i.e., if /' G J" is such that e' >~ f and /' 
has a predecessor / w.r.t. then j ^T. 

If no such equation e' exists, then we are finished, and we have E = T . Otherwise, let e' 
be such an equation and e its predecessor w.r.t. — We add e to T . In order to show that 
the invariant (*) is still satisfied, we make a case distinction according to which rule was 
applied to e to produce e': 

(1) Eager- Assignment. By an application of this rule, the assignment for X is modified 
from 5^ = to 5x = {^1, • • • , ^n}, where Ax,..., A n are non-variable atoms. In 
addition, X is labeled as finished. Since the assignment of a finished variable cannot be 
changed anymore, we know that we also have Sx = {^1, • • • , A^} in the final assignment, 
and thus <j{X) = cr(^i) □ . . . □ (t(A„). The rule modifies equations as follows: all 
equations containing X are expanded w.r.t. the assignment Sx = {^1, • ■ • Since 
e is transformed into e' using this rule, it must contain X. We assume for the sake 
of simplicity that X is contained in the left-hand side of e, but not in the right-hand 
side, i.e., e is of the form C U X D and the new equation e' G V obtained from e is 
CnXnAiU. . .nAn =^ D. Since a solves e', we have cr(D) = a{Cr\Xr\Axr\. . .UA^) = 
a{C)na{Ai)n. . .n(j{An)na{Ai)n. . .na{An) = a{C)na{Ai)n. . .na{An) = (7(CnX), 
which shows that a also solves e. 

(2) Decomposition. Without loss of generality, we consider the L-variant of this rule. Thus, 
the equation e is of the form Z) □ 3 r.C = □ 3 r.B, and e' is obtained from e by adding 
3r.C to the right-hand side, i.e., e' is of the form D □ 3 r.C =^ E U 3r.B □ 3 r.C We 
know that a solves e' . Thus, if we can show o'{B) C c((7), then we have a{D □ 3 r.C) = 
a{E) n (T(3r.Z?) □ c7(3r.C) = (t{E) □ a{3r.B), which shows that a solves e. 

Consequently, it is sufficient to prove a{B) C cf{C). The Decomposition rule also 
generates the equation C U B =^ B and expands it w.r.t. the assignments of all the 
variables contained in this equation, unless this equation has already been generated 
before. Thus, either this application or a previous one of the Decomposition rule has 
generated the equation CuB =^ B, and then expanded it (w.r.t. the current assignment 
at that time) to an equation ei. Since atoms are never removed from an assignment, the 
atoms present in the assignment at the time when the Decomposition rule generated 
the equation C H B =^ B are also present in the final assignment used to define the 
substitution a. Thus, if we can show that a solves ei, then we have also shown that a 
solves C n B =^ B, and thus satisfies (t{B) C (t{C). 

Since equations are never completely removed by our rules, but only modified, there 
is a sequence of equations ei —)■ 62 —)■... ^ e„ such that e„ G T. Property (**) thus 
yields ei ~ct ^2 ■ ■ ■ &n- In addition, the role depth C H B =^ B w.r.t. a is the 
same as the role depth of ei w.r.t. a. Consequently, we have e' >- for alH, 1 < i < n. 
Now, assume that e\ T. Then there is an z > 1 such that G but ej_i E £\J^. 
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This contradicts our assumption that e' is minimal. Thus, we have shown that ei G J-, 
and this impHes that a solves ei. 

Overall, this finishes the proof that a solves e. 
(3) Extension. By an application of this rule, the assignment for X is modified by adding 
a non- variable atom A to it. Since atoms are never removed from an assignment, we 
know that we also have A G Sx in the final assignment, and thus (j{X) C (t[A). 
The rule modifies equations as follows: all equations containing X are expanded w.r.t. 
the new assignment for X. Since e is transformed into e' using this rule, it must 
contain X. We assume for the sake of simplicity that X is contained in the left-hand 
side of e, but not in the right-hand side, i.e., e is of the form C n X =■ D and the 
new equation e' obtained from e \s C U X U A =' D. Since a solves e', we have 
a{D) = a{CnXnA)= a{C) U a{X) U a{A) = a{C) n a{X) = a{C n X), which shows 
that fj also solves e. 

To sum up, we have shown that the invariant (*) is still satisfied after adding e to J-. This 
completes the proof of soundness of our procedure. □ 

It remains to show completeness of Algorithm l6.2i Thus, assume that the input unifica- 
tion problem Tq is solvable. Proposition 15.41 tells us that Fq then has an is-minimal reduced 
ground unifier 7, and Proposition 15.111 implies that, for every variable X occurring in Tq, 
there is a set S]^ of non- variable atoms of Tq such that 

7(x)^7(n^i), 

where, for a set of non-variable atoms S of Tq, the expression PIS' denotes the conjunction 
of the elements of S (where the empty conjunction is T). 

Lemma 6.6 (Completeness). Let Tq be a flat EC-unification problem, and assume that 7 is 
an is-minimal reduced ground unifier oJTq. Then there is a successful run of Alaorithm \6.^ 
on input Tq that returns a unifier a that is equivalent to 7, i.e., satisfies cr{X) = 'y{X) for 
all variables X occurring in Tq. 

Proof. The algorithm starts with F := Fq and the initial assignment Sx '■= for all variables 
X occurring in Fq. It then applies rules that change F and the current assignment as long 
as the problem F contains an unsolved equation. 

We use 7 to guide the (don't know) non-deterministic choices to be made during the 
algorithm. We show that this ensures that the run of the algorithm generated this way does 
not fail and that the following invariants are satisfied throughout this run: 

(11) 7 is a unifier of F; 

(12) for all atoms B E Sx there exists an atom A € Sj^ such that 7(A) C 7(-B); 
(/a) for all finished variables X we have 'y{X) = j(\~\Sx)- 

Before constructing a run that satisfies these invariants, let us point out two interesting 
consequences that they have: 

(Ci) The current assignment is always acyclic. In fact, if X directly depends on Y, then 
there is an atom B G Sx that has the form B = 3 r.Y for some role name r. Invariant 
I2 then implies that there is an ^4 G 5^ such that ^{X) C 7(A) ^ ^{B) = 3r.^iY). 
Thus, if X depends on X, then there are k > \ role names ri,...,rfc such that 
7(X) C 3ri. • • • 3rfc.7(X), which is impossible. 
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(C2) For each variable X occurring in Tq, we have "y{X) C cr{X), where a is the current 
substitution induced by the current assignment. This is again a consequence of invari- 
ant l2- Indeed, recall that the fact that the current assignment is acyclic implies that 
there is a strict linear order > on the variables occurring in T such that X > Y ii 
X depends on Y. The current substitution a is defined along this order. We prove 
'y{X) C a{X) by induction on this order. 

Consider the least variable X. If Sx = 0, then cr{X) = T, and thus ^{X) C cr{X) 
is trivially satisfied. Otherwise, we know, for every B G Sx, that it does not contain 
any variables, which implies that (t{B) = B = "){B) □ "f{A) for some atom A € 5^. 
Obviously, this yields a{X) = (t{[]Sx) □ liU^x) = li^)- 

Now, assume that ^iX) E <7{X) holds for all variables Y < X. Since the concept 
constructors of £C are monotone w.r.t. subsumption, this implies 7(C) 1^ a[C) for 
all concept terms C containing only variables smaller than X. If Sx is empty, then 
(t{X) = T □ ^{X) is trivially satisfied. Otherwise, we know, for every B G Sx, that 
it contains only variables smaller than X. This yields cr[B) □ ^{B) □ 7(^) for some 
atom A € S\. Again, this implies a{X) = a{[]Sx) □ liU^x) = l{X). 
Since 7 was assumed to be an is-minimal unifier of Fq, the consequence C2 implies that cr 
can only be a unifier of Fq if a is equivalent to 7. If the run has terminated successfully, 
then the final substitution a obtained by the run is a unifier of Fq (due to soundness). Thus, 
in this case the computed unifier a is indeed equivalent to 7. Consequently, to prove the 
lemma, it is sufficient to construct a non-failing run of the algorithm that satisfies the above 
invariants. 

The invariants are initially satisfied since 7 is a unifier of Fq, the initial assignment 
for all variables X occurring in Fq is Sx = 0, and there are no finished variables. Now, 
assume that, by application of the rules of Algorithm 16.21 we have constructed a unification 
problem F and a current assignment such that the invariants are satisfied. 

(1) If all equations in F are solved, then the run terminates successfully, and we are done. 

(2) If there is an unsolved equation to which the Eager-Assignment rule applies, then the 
algorithm picks such an equation e and applies this rule to it. Without loss of generality, 
we assume that the L-variant of the rule is applied. The selected equation e is of the 
form 

X n Zi n . . . n Zfc =• Ai n . . . n yl„ n Yi n . . . n y„, 

where Ai, . . . , An are non- variable atoms, and Yi, . . . , Ym, Zi, . . . are finished vari- 
ables. Because the left-hand side of the equation does not contain any non-variable 
atoms, we know that Sx = Szi = . . . = Sz,. = (since the algorithm keeps all equa- 
tions expanded). Since Zi, . . . ,Zk are finished, we thus have ^{Z\) = . . . = j{Zk) = T 
(by invariant I^). We also know that S'y. C {^i, . . . , An} for alH, 1 < i < m. Since the 
variables Yi are finished, invariant implies that 7(1^) 3 7(^1) n . . . □ 7(j4„). 

The new assignment for X is Sx = {Ai, . . . , An}, all equations containing X are 
expanded w.r.t. this assignment, and X becomes a finished variable. First, we show 
that I3 is satisfied. Nothing has changed for the variables that were already finished 
before the application of the rule. However, X is now also finished. Thus, we must 
show that 7(X) = 7(P|5x)- We know that 7 solves the equation e (by Ii). This yields 

7(X) = 7(X) n 7(Zi) n . . . n ^{Zk) = 7(^1) n . . . n j{An) n jiY^) n . . . n 7(y„) = 

7(j4i) n . . . n 7(A„) = j{\~\Sx)- Regarding I2, the only assignment that was changed 
is the one for X. Since the new assignment for X is Sx = {^1, • • • , An}, and we have 
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already shown that ^{X) = 7(^1) □ . . . n^[An), the invariant I2 holds by Corollary 13. 21 
Note that this also implies that the new assignment is acyclic, and thus the application 
of the Eager- Assignment rule does not fail. Finally, consider the invariant Ii. The rule 
application modifies equations containing X by adding the atoms j4i,...,A„. Since 
7(X) = 7(Ai)n. . .n7(^ n)i an equation that was solved by 7 before this modification, is 
also solved by 7 after this modification. To sum up, we have shown that the application 
of the Eager- Assignment rule does not fail and preserves the invariants. 
(3) If there is no unsolved equation to which the Eager- Assignment rule applies, then 
the algorithm picks an unsolved equation e and an unsolved atom A occurring in this 
equation. We must show that we can apply either the Decomposition or the Extension 
rule to A m. e such that the invariants stay satisfied. Without loss of generality, we 
assume that the unsolved atom A occurs on the left-hand side of the equation e. 
(a) First, assume that A is an existential restriction A = 3r.C. The selected unsolved 
equation e is thus of the form 

3r.C n Ai n . . . n Am = Bi n . . . n Bn, 

where Ai, . . . , A^, and Bi, . . . , Bn are (variable or non- variable) atoms and 3 r.C 
{Bi, . . . ,Bn}- Since 7 solves this equation (by invariant /i). Corollary 13.21 implies 
that there must be an i, 1 < z < n, such that j{Bi) C 3r.j{C). 

(i) If Bi is an existential restriction Bi = 3r.B, then we have 7(-B) E 7{C)- 
We apply the Decomposition rule to A and Bi. The application of this rule 
modifies the equation e to an equation e' by adding the atom A to the right- 
hand side. In addition, it generates the equation C U B =^ B and expands it 
w.r.t. the assignments of all variables contained in this equation (unless this 
equation has been generated before). After the application of this rule, the 
invariants I2 and /s are still satisfied since the current assignments and the set 
of finished variables remain unchanged. Regarding invariant /i, since 7 solves 
e, it obviously also solves e' due to the fact that 7(i?i) ^ 7(^) and Bi is a 
conjunct on the right-hand side of e. In addition, 7(^6) E liC) implies that 7 
also solves the equation CnB =• B. Since invariant I2 is satisfied, this implies 
that 7 also solves the equation obtained from C V^B =■ -B by expanding it 
w.r.t. the assignments of all variables contained in it. 

(ii) Assume that there is no 1 < i < n, such that Bi is an existential restriction 
satisfying jiBi) C 7(^4). Thus, if Bi is such that 'y{Bi) C 7(^), then we know 
that Bi = X \s a, variable. We want to apply the Extension rule to A and X. 
To be able to do this, we must first show that X is not a finished variable. 
Thus, assume that X is finished, and let Sx = {Ci, . . . ,0^}. Invariant 
yields 7(Ci) n . . . n -f{Ci) = 7(X) = -f{Bi) C -f{A) = 3 r.7(C), and thus there 
is a j,l < j < i, such that jiCj) C li-A). Since A is an existential restriction, 
the non-variable atom Cj must also be an existential restriction, and since the 
equation e is expanded, Cj E Sx occurs on the right-hand side of this equation. 
This contradicts our assumption that there is no such existential restriction 
on the right-hand side. Thus, we have shown that X is not finished, which 
means that we can apply the Extension rule to A and X. 

The application of this rule adds the atom A to the assignment for X, and 
it expands all equations containing X w.r.t. this new assignment, i.e., it adds 
A to the left-hand side and/or right-hand side of an equation whenever X is 
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contained in this side. Since we know that j{X) Q 7(^), it is easy to see 
that, if 7 solves an equation before this expansion, it also solves it after the 
expansion. Thus invariant Ii is satisfied. Invariant I2 also remains satisfied. 
In fact, if Si = {Di, . . .,0^}, then 7(1)1) n . . . n -f{Dk) = jiX) C liA) 
implies that there is a j, 1 < j < £, such that ^{Dj) C 7(^)- The fact that 
I2 is satisfied by the new assignment also implies that this new assignment is 
acyclic, and thus the application of the Extension rule does not fail. Invariant 
/s is still satisfied since X is not finished, and the assignments of variables 
different from X were not changed, 
(b) Second, assume that A is a concept name. The selected unsolved equation e is thus 
of the form 

An AiH . . Am = BiH . . .nBn, 
where j4i, . . . , Am and -Bi, . . . , are (variable or non- variable) atoms, and A 
{Bi, . . . ,Bn}- Since 7 solves this equation (by invariant Ii), Corollary 13.21 implies 
that there must be an z, 1 < z < n, such that ^{Bi) C ^{A) = A. Since A ^ 
{Bi, . . . , i?„}, we know that Bi = X \s a, variable. We want to apply the Extension 
rule to A and X. To be able to do this, we must first show that X is not a finished 
variable. 

Thus, assume that X is finished, and let Sx = {C*!, ■ ■ ■ ^Cf}. Invariant yields 
7(Ci) n . . . n 7(Q) = 7(X) = -f{Bi) C 7(yl) = A, and thus there is a j, 1 < j < 
such that "y{Cj) Q A. Since ^ is a concept name, the non-variable atom Cj must 
actually be equal to A, and since the equation e is expanded, Cj = A € Sx occurs 
on the right-hand side of this equation. This contradicts our assumption that A 
is an unsolved atom. Thus, we have shown that X is not finished, which means 
that we can apply the Extension rule to A and X. The application of this rule 
adds the atom A to the assignment for X, and it expands all equations containing 
X w.r.t. this new assignment. The proof that this rule application does not fail 
and preserves the invariants is identical to the one for the case where A was an 
existential restriction. □ 

To sum up, we have shown that Algorithm 16.21 always terminates (in non-deterministic 
polynomial time) and that it is sound and complete. This finishes the proof of Theorem l6.31 

7. Unification in semilattices with monotone operators 

Unification problems and their types were originally not introduced for Description 
Logics, but for equational theories [16]. In this section, we show that the above results for 
unification in £C can actually be viewed as results for an equational theory. As shown in 
[28j , the equivalence problem for SC-concept terms corresponds to the word problem for the 
equational theory of semilattices with monotone operators. In order to define this theory, 
we consider a signature SsLmO consisting of a binary function symbol A, a constant symbol 
1, and finitely many unary function symbols fi, ■ ■ ■ , fn- Terms can then be built using these 
symbols and additional variable symbols and free constant symbols. 
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Definition 7.1. The equational theory of semilattices with monotone operators is defined 
by the following identities: 

SLmO := {x A {y A z) = (x A y) A z, x Ay = y Ax, x Ax = x, x A 1 = x} U 

{U{x Ay) A fi{y) = h{x A y) \ 1 < i < n} 

A given £^>C-concept term C using only roles ri , . . . , r„ can be translated into a term 
tc over the signature T,sLmO by replacing each concept constant ^4 by a corresponding free 
constant a, each concept variable X by a corresponding variable x, T by 1, □ by A, and 3 rj 
by fi. For example, the £'>C-concept term C = An 3ri.T n 3r3.{X □ B) is translated into 
tc = a A fi{l) A /^{x Ab) . Conversely, any term over the signature ^sLmO can be translated 
back into an if >C-concept term. 

Lemma 7.2. Let C,D be SC-concept term using only roles ri,...,r„. Then C = D iff 

tc =SLmO to- 

As an immediate consequence of this lemma, we have that unification in the DL £C 
corresponds to unification modulo the equational theory SLmO. Thus, Theorem 14.11 implies 
that SLmO has unification type zero, and Theorem 15.141 implies that S'Lm 0-unification is 
NP-complete. 

Corollary 7.3. The equational theory SLmO of semilattices with monotone operators has 
unification type zero, and deciding solvability of an SLmO -unification problem is an NP- 
complete problem. 

Since the unification problem introduced in Theorem 14.11 contains only one role r, this 
is already true in the presence of a single monotone operator. 

8. Conclusion 

In this paper, we have shown that unification in the DL £C is of type zero and NP- 
complete. There are interesting differences between the behavior of £C and the closely 
related DL TCq w.r.t. unification and matching. Though the unification types coincide for 
these two DLs, the complexities of the decision problems differ: J-'>Co-unification is ExpTime- 
complete, and thus considerably harder than £" /^-unification. In contrast, J-'/3o-™atching is 
polynomial, and thus considerably easier than £^>C-matching, which is NP-complete. In 
addition to showing the complexity upper bound for <5£-unification by a simple "guess and 
then test" NP-algorithm, we have also developed a more goal-oriented NP-algorithm that 
makes (don't know) non-deterministic decisions (i.e., ones that require backtracking) only 
if they are triggered by unsolved atoms in the unification problem. 

As future work, we will consider also unification of concept terms for other members 
of the f/3-family of DLs [.5j. In addition, we will investigate unification modulo more 
expressive terminological formalisms. On the practical side, we will optimize and implement 
the goal-oriented £'>C-unification algorithm developed in Section [6l We intend to test the 
usefulness of this algorithm for the purpose on finding redundancies in £'£-based ontologies 
by considering extensions of the medical ontology Snomed ct. For example, in |18j, two 
different extensions of Snomed ct by so-called post-coordinated concepts were considered. 
The authors used an (incomplete) equivalence test to find out how large the overlap between 
the two extensions is (i.e., how many of the new concepts belonged to both extensions). 
As pointed out in the introduction, the equivalence test cannot deal with situations where 
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different knowledge engineers use different names for concepts, or model on different levels 
of granularity. We want to find out whether using unifiability rather than equivalence finds 
more cases of overlapping concepts. Of course, in the case of unification one may also obtain 
false positives, i.e., pairs of concepts that are unifiable, but are not meant to represent the 
same (intuitive) concept. It is also important to find out how often this happens. Another 
problem to be dealt with in this application is the development of heuristics for choosing 
the pairs of concepts to be tested for unifiability and for deciding which concept names are 
turned into variables. 
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